Skip to main content

Survey Crawls

Survey crawls are run about twice a year, on average, and attempt to capture the content of the front page of every web host ever seen by the Internet Archive since 1996.

95,942
RESULTS
rss


PART OF
Internet Archive Web Crawls
Media Type
15
collections
95,916
web
11
data
Year
5,112
2019
12,195
2018
13,442
2017
14,424
2016
20,654
2015
12,579
2014
More right-solid
Topics & Subjects
95,916
crawldata
2
webcrawl
1
net
1
webwidecrawl
Collection
More right-solid
Creator
95,187
internet archive
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
collection
eye 969.4M
The seed for this crawl was a list of every host in the Wayback Machine This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds) The WARC files associated with this crawl are not currently available to the general public.
collection
eye 654.1M
The seed for this crawl was a list of every host in the Wayback Machine This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds) The WARC files associated with this crawl are not currently available to the general public.
collection
eye 469.1M
The seed for this crawl was a list of every host in the Wayback Machine This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds) The WARC files associated with this crawl are not currently available to the general public.
collection
eye 376.2M
The seed for this crawl was a list of every host in the Wayback Machine This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds) The WARC files associated with this crawl are not currently available to the general public.
Survey Crawl Number 6: Sep 11th, 2017 - running now
collection
13,723
ITEMS
336.4M
VIEWS
collection
eye 336.4M
The seeds for this crawl came from: 251 million Domains that had at least one link from a different domain in the Wayback Machine, across all time ~ 300 million Domains that we had in the Wayback, across all time 55,945,067 Domains from https://archive.org/details/wide00016 This crawl was run with a Heritrix setting of "maxHops=0" (URLs including their embeds) The WARC files associated with this crawl are not currently available to the general public.
collection
eye 321.3M
The seed for this crawl was a list of every host in the Wayback Machine This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds) The WARC files associated with this crawl are not currently available to the general public.
.com survey started January 2011
collection
2,535
ITEMS
213.6M
VIEWS
collection
eye 213.6M
Survey crawl of .com domains started January 2011.
Topic: webcrawl
Survey Crawl Number 7
collection
6,605
ITEMS
197.2M
VIEWS
collection
eye 197.2M
This "Survey" crawl was started on Feb. 24, 2018. This crawl was run with a Heritrix setting of "maxHops=0" (URLs including their embeds) Survey 7 is based on a seed list of 339,249,218 URLs which is all the URLs in the Wayback Machine that we saw a 200 response code from in 2017 based on a query we ran on Feb. 1st, 2018.   The WARC files associated with this crawl are not currently available to the general public.
collection
eye 110.4M
The seed for this crawl was a list of every host in the Wayback Machine This crawl was run at a level 1 (URLs including their embeds, plus the URLs of all outbound links including their embeds) The WARC files associated with this crawl are not currently available to the general public.
Survey Crawl Number 8
collection
7,918
ITEMS
44.4M
VIEWS
collection
eye 44.4M
COM Survey Crawl 2009-2010
collection
729
ITEMS
39.7M
VIEWS
collection
eye 39.7M
COM survey crawl data collected by Internet Archive in 2009-2010. This data is currently not publicly accessible.
survey_net00000
collection
300
ITEMS
27.5M
VIEWS
collection
eye 27.5M
Survey crawl of .net domains started December 2010.
Topic: webcrawl
ORG Survey Crawls
collection
191
ITEMS
16.4M
VIEWS
collection
eye 16.4M
Survey of .org domains. This data is currently not publicly accessible.
survey_net00001
collection
170
ITEMS
10.5M
VIEWS
collection
eye 10.5M
Survey crawl of .net domains started October 2011.
Topics: webwidecrawl, net
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl835.us.archive.org:survey from Sun Jan 10 18:55:42 PST 2016 to Sun Jan 10 11:20:26 PST 2016.
Topic: crawldata
Survey Crawl Number 7
web
eye 2M
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl-hq10.us.archive.org:survey from Sat Feb 24 03:26:41 PST 2018 to Fri Feb 23 19:55:17 PST 2018.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl428.us.archive.org:survey from Thu Jan 8 02:39:52 PST 2015 to Thu Jan 8 02:17:55 PST 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl344.us.archive.org:survey from Thu Oct 12 08:48:34 PDT 2017 to Thu Oct 12 01:56:31 PDT 2017.
Topic: crawldata
Survey Crawl Number 8
web
eye 1.6M
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl818.us.archive.org:survey from Fri Dec 14 21:00:08 PST 2018 to Fri Dec 14 22:34:06 PST 2018.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl420.us.archive.org:survey from Mon Dec 23 10:49:15 PST 2013 to Mon Dec 23 05:58:22 PST 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl452.us.archive.org:survey from Sun Jan 25 16:21:37 PST 2015 to Sun Jan 25 09:51:07 PST 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl422.us.archive.org:survey from Sat Dec 20 22:29:58 PST 2014 to Sat Dec 20 16:52:21 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl429.us.archive.org:survey from Fri Jan 17 02:48:53 PST 2014 to Thu Jan 16 23:21:44 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl336.us.archive.org:survey from Fri Jan 17 01:34:59 PST 2014 to Thu Jan 16 20:57:28 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl419.us.archive.org:survey from Thu Jan 16 23:17:39 PST 2014 to Thu Jan 16 20:04:33 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl454.us.archive.org:survey from Thu Dec 12 08:45:40 PST 2013 to Thu Dec 12 01:43:13 PST 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl804.us.archive.org:survey from Mon Feb 22 06:57:25 PST 2016 to Mon Feb 22 01:01:29 PST 2016.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl804.us.archive.org:survey from Fri Dec 19 15:17:43 PST 2014 to Fri Dec 19 08:41:03 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl419.us.archive.org:survey from Thu Dec 19 09:08:50 PST 2013 to Thu Dec 19 02:00:44 PST 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl423.us.archive.org:survey from Sun Dec 21 20:51:55 PST 2014 to Sun Dec 21 16:00:10 PST 2014.
Topic: crawldata
.com survey started January 2011
web
eye 826,150
favorite 0
comment 0
Internet Archive crawldata from COM zone survey, captured by crawl309.us.archive.org:com from Wed Feb 23 14:38:26 PST 2011 to Wed Feb 23 08:38:53 PST 2011.
Topic: crawldata
Survey Crawl Number 7
web
eye 812,777
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl840.us.archive.org:survey from Tue May 15 20:39:06 PDT 2018 to Tue May 15 21:00:30 PDT 2018.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 792,439
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl339.us.archive.org:survey from Sat Sep 30 04:27:26 PDT 2017 to Fri Sep 29 21:56:42 PDT 2017.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl416.us.archive.org:survey from Sun Dec 29 12:12:47 PST 2013 to Sun Dec 29 06:16:27 PST 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl429.us.archive.org:survey from Sun Dec 29 14:04:04 PST 2013 to Sun Dec 29 07:52:39 PST 2013.
Topic: crawldata
Survey Crawl Number 7
web
eye 734,758
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl841.us.archive.org:survey from Mon May 14 17:51:22 PDT 2018 to Mon May 14 16:01:52 PDT 2018.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl427.us.archive.org:survey from Wed Dec 31 23:34:59 PST 2014 to Wed Dec 31 21:31:58 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl335.us.archive.org:survey from Thu Jun 5 00:24:07 PDT 2014 to Wed Jun 4 22:31:50 PDT 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl338.us.archive.org:survey from Wed Dec 31 23:29:37 PST 2014 to Wed Dec 31 21:16:42 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl425.us.archive.org:survey from Thu Jan 1 00:34:54 PST 2015 to Thu Jan 1 01:57:10 PST 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl426.us.archive.org:survey from Thu Jan 1 07:23:56 PST 2015 to Thu Jan 1 05:20:34 PST 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl423.us.archive.org:survey from Thu Jan 29 19:18:01 PST 2015 to Thu Jan 29 11:53:35 PST 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl453.us.archive.org:survey from Sun May 18 19:10:35 PDT 2014 to Sun May 18 13:22:44 PDT 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl429.us.archive.org:survey from Thu Feb 4 02:49:00 PST 2016 to Wed Feb 3 20:18:27 PST 2016.
Topic: crawldata
Data crawled by Internet Archive on behalf of Internet Archive from Sat Jan 02 00:17:41 PDT 2010 to Sat Jan 02 00:40:59 PDT 2010
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl811.us.archive.org:survey from Wed Feb 24 03:27:27 PST 2016 to Wed Feb 24 01:58:44 PST 2016.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl452.us.archive.org:survey from Tue Jul 23 20:44:16 PDT 2013 to Tue Jul 23 15:03:33 PDT 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl457.us.archive.org:survey from Thu May 14 12:28:39 PDT 2015 to Thu May 14 06:26:41 PDT 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl801.us.archive.org:survey from Mon Jan 26 09:41:07 PST 2015 to Mon Jan 26 15:41:07 PST 2015.
Topic: crawldata
.com survey started January 2011
web
eye 589,476
favorite 0
comment 0
Internet Archive crawldata from COM zone survey, captured by crawl309.us.archive.org:com from Tue Feb 22 20:36:54 PST 2011 to Tue Feb 22 14:45:19 PST 2011.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl425.us.archive.org:survey from Tue Dec 30 01:55:11 PST 2014 to Tue Dec 30 00:46:52 PST 2014.
Topic: crawldata
Survey Crawl Number 7
web
eye 581,087
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl847.us.archive.org:survey from Sun Mar 4 03:42:49 PST 2018 to Sat Mar 3 21:34:41 PST 2018.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl428.us.archive.org:survey from Mon Dec 29 19:10:21 PST 2014 to Mon Dec 29 15:19:16 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl453.us.archive.org:survey from Tue Dec 30 19:27:32 PST 2014 to Tue Dec 30 16:19:14 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl428.us.archive.org:survey from Mon Dec 29 15:42:02 PST 2014 to Mon Dec 29 11:10:21 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl336.us.archive.org:survey from Wed Jan 28 02:55:43 PST 2015 to Wed Jan 28 07:20:00 PST 2015.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl803.us.archive.org:survey from Mon Dec 29 16:24:24 PST 2014 to Mon Dec 29 16:12:36 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl338.us.archive.org:survey from Mon Dec 29 19:47:20 PST 2014 to Mon Dec 29 15:22:22 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl336.us.archive.org:survey from Mon Dec 29 17:40:49 PST 2014 to Mon Dec 29 13:50:46 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl805.us.archive.org:survey from Mon Dec 29 17:33:28 PST 2014 to Mon Dec 29 15:25:11 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl804.us.archive.org:survey from Mon Dec 29 16:39:30 PST 2014 to Mon Dec 29 15:21:31 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl809.us.archive.org:survey from Mon Dec 29 20:07:14 PST 2014 to Mon Dec 29 18:18:25 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl426.us.archive.org:survey from Mon Dec 29 17:24:53 PST 2014 to Mon Dec 29 12:36:45 PST 2014.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl419.us.archive.org:survey from Mon Dec 23 06:56:18 PST 2013 to Mon Dec 23 00:11:42 PST 2013.
Topic: crawldata
Survey Crawl Number 7
web
eye 521,007
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl-hq10.us.archive.org:survey from Sat Feb 24 10:06:32 PST 2018 to Sat Feb 24 02:13:22 PST 2018.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl809.us.archive.org:survey from Thu Jan 1 00:35:40 PST 2015 to Thu Jan 1 01:15:39 PST 2015.
Topic: crawldata
.com survey started January 2011
web
eye 483,223
favorite 0
comment 0
Internet Archive crawldata from COM zone survey, captured by crawl309.us.archive.org:com from Sat Feb 12 04:25:02 PST 2011 to Tue Feb 22 12:28:53 PST 2011.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl838.us.archive.org:survey from Sun Jan 10 18:12:42 PST 2016 to Sun Jan 10 12:15:44 PST 2016.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl453.us.archive.org:survey from Fri May 31 21:15:22 PDT 2013 to Sun Jun 2 06:31:08 PDT 2013.
Topic: crawldata
Data crawled by Internet Archive on behalf of Internet Archive from Sat Jan 02 00:42:27 PDT 2010 to Sat Jan 02 01:13:48 PDT 2010
Topic: crawldata
Survey Crawl Number 7
web
eye 449,592
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl843.us.archive.org:survey from Sun Mar 4 03:11:53 PST 2018 to Sat Mar 3 21:10:41 PST 2018.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl453.us.archive.org:survey from Wed May 22 00:29:45 PDT 2013 to Wed May 29 10:38:37 PDT 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl451.us.archive.org:survey from Sat May 25 23:20:37 PDT 2013 to Sun May 26 23:08:39 PDT 2013.
Topic: crawldata
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl452.us.archive.org:survey from Sun May 26 01:33:43 PDT 2013 to Mon May 27 01:54:34 PDT 2013.
Topic: crawldata
Survey Crawl Number 6: Sep 11th, 2017 - running now
web
eye 433,669
favorite 0
comment 0
Internet Archive crawldata from Survey Webwide Crawl, captured by crawl344.us.archive.org:survey from Sun Oct 29 08:31:23 PDT 2017 to Sun Oct 29 02:17:27 PDT 2017.
Topic: crawldata